# Speech recognition fine-tuning
Wav2vec2 Large Xlsr 53 English Pronunciation Evaluation Aod Cut Balance
Apache-2.0
English pronunciation assessment model based on wav2vec2-large-xlsr-53 for evaluating English pronunciation quality
Audio Classification
Transformers

W
hafidikhsan
35
5
Neunit Ks
Apache-2.0
A speech processing model fine-tuned based on facebook/wav2vec2-base, with an accuracy of 28.57%
Speech Recognition
Transformers

N
SHENMU007
23
0
Wav2vec2 Nsc Final 2 Google Colab
Apache-2.0
A speech processing model fine-tuned based on facebook/wav2vec2-base, specific purpose not clearly stated
Speech Recognition
Transformers

W
YuanWellspring
14
0
Wav2vec2 Base Librispeech Demo Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the LibriSpeech dataset based on facebook/wav2vec2-base, suitable for English speech-to-text tasks.
Speech Recognition
Transformers

W
khanhnguyen
24
0
Part1
Apache-2.0
This model is a fine-tuned speech processing model based on facebook/wav2vec2-base, with no specific use case explicitly stated
Speech Recognition
Transformers

P
zasheza
28
0
Wav2vec2 Base Toy Train Data Random Low Pass
Apache-2.0
This model is a speech recognition model fine-tuned on an unknown dataset based on facebook/wav2vec2-base, primarily used for Automatic Speech Recognition (ASR) tasks.
Speech Recognition
Transformers

W
scasutt
29
0
Wav2vec2 Large Xlsr 53 Toy Train Data Masked Audio 10ms
Apache-2.0
Speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, optimized on 10ms audio masked training data
Speech Recognition
Transformers

W
scasutt
22
0
Wav2vec2 Base Toy Train Data Masked Audio 10ms
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-base, trained on 10ms masked audio tasks
Speech Recognition
Transformers

W
scasutt
22
0
Wav2vec2 Base Toy Train Data Augment 0.1
Apache-2.0
A speech recognition model fine-tuned from facebook/wav2vec2-base, trained on a toy dataset with 0.1 ratio data augmentation applied
Speech Recognition
Transformers

W
scasutt
22
0
Wav2vec2 Large Xlsr 53 Toy Train Data Augment 0.1.csv
Apache-2.0
This model is a speech recognition model fine-tuned from facebook/wav2vec2-base, trained using data augmentation techniques
Speech Recognition
Transformers

W
scasutt
22
0
Wav2vec2 Base Toy Train Data Augment 0.1.csv
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base, utilizing data augmentation techniques (augmentation ratio of 0.1).
Speech Recognition
Transformers

W
scasutt
21
0
Sew D Small 100k Ft Timit
Apache-2.0
An automatic speech recognition model fine-tuned on the TIMIT_ASR dataset based on asapp/sew-d-small-100k
Speech Recognition
Transformers

S
patrickvonplaten
18
0
Sew D Small 100k Timit
Apache-2.0
This model is an automatic speech recognition model fine-tuned from asapp/sew-d-small-100k on the TIMIT_ASR - NA dataset, achieving a word error rate of 0.8061 on the evaluation set.
Speech Recognition
Transformers

S
patrickvonplaten
16
0
Featured Recommended AI Models